REPORT RESUMES 

ED 013 8^2 UO 002 975 

ABILITY GROUPING --WHAT G.:OD IS IT. 

BY- JUSTMAN, JOSEPH 

PUB DATE FEB 67 

EDRS PRICE MF-iO.25 HC-K1.16 4P. 

DESCRIPTORS- ❖ABILITY GRCiUPlNG, ❖H*:*MC'GENE<DUS GROUPING, 
HETEROGENEOUS GROUPING, GRADE 3,' GRADE 4, ❖ACHIEVEMENT GAINS, 
READING TESTS, SPECIAL PRCORAMS, ELEMENTARY SCHOXS, 
EXPERIMENTS, STUDENTS, DATA, METROPC»L I TAN READING TEST 

RESEARCH FINDINGS ON ABILITY GRC»UFING ARE INCONCLUSIVE 
BECAUSE NEITHER HETEROGENEITY Ni:»R HXICGENEITY HAS BEEN 
DEFINED WITH SUFFICIENT CLARITY. THE TENDENCY IN THESE 
STUDIES HAS BEEN TO STRESS THE PERFORMANCE Cf THE PUPILS IN 
SUCH CLASSES RATHER THAN THE PERFORMANCE Cf THE CLASS AS A 
WH«XE. IN A STUDY OF 101 CLASSES (4,705 PUPILS) H'r'MiXENEITY 
WAS MEASURED BY THE STANDARD DEVIATICW Cf CLASS PERFORMANCE 
ON THE FIRST TWO METROPOLITAN READING TESTS GIVEN IN TWO 
SUCCESSIVE YEARS. GROWTH WAS THEN DETERMINED BY THE 
DIFFERENCES IN CLASS MEANS CN THE TWO TESTS. THE SAME 
SUBJECTS WERE TESTED IN GRADE THREE AND FOUR, AND WERE 
DIVIDED INTO HIGH, AVERAGE, AND LCW LEVELS CP ACHIEVEMENT AND 
DEGREE OF HOMi:<GENEITY. (A STANDARD DEVIATIC'N CP 6.0 THR«:'UGH 
8.9 MONTHS CHARACTERIZED "AVERAGE HXlirGENEITY. ") FINDINGS 
SHOW AN INCONSISTENT GRCWTH PATTERN— (1) CN THE W:«RD 
KNOWLEDGE SUBTEST, MEAN GROWTH WAS PRACTICALLY IDENTICAL FOR 
THE AVERAGE AND LOW HOftOENEITY CLASSES, AND (2) ON THE 
READING SUBTEST, THE LOW H:»M:CENEITY CLASSES SHIWED GREATER 
GROWTH THAN THE AVERAGE OR HIGH CLASSES. EVIDENCE CP 
INCONSISTENCY WAS ALSO EVIDENT WHEN VARICNS COMBINATIONS CP 
INITIAL ACHIEVEMENT LEVEL AND CLASS HOMOGENEITY WERE 
ANALYZED. THEREFORE, NARROWING THE RANGE CP ABILITY IN 
CLASSES DC€S NOT IF&O FACTO IMPROVE PUPIL ACHIEVEMENT. 
PROGRAMS DESIGNED SPECIFICALLY FOR THE SEVERAL ABILITY LEVELS 
ARE NEEDED AS A CONCOMITANT OF ABILITY GR«0UFING, THIS ARTICLE 
WAS PUBLISHED IN "THE URBAN REVIEW," VOLUME 2, FEBRUARY 1967. 
(NH) 






OFFICE OF EDUCATION 



ED013842 



Ability Grouping -yynat uood It It? 

by Joseph Justman 



THIS DOCUMENT HAS BEEN REPRODUCED EXACTLY AS RECEIVED FROM THE 
PERSON OR ORGANIZATION ORIGINATING IT. POINTS OF V'F,W OR OPINIONS 
STATED DO NOT NECESSARILY REPRESENT OFFICIAL OFFICE OF EDUUTION 
POSITION OR POLICY. 



If one were to ask an elementary school supervisor why he uses ability group- 
ing in organizing his school at the beginning of each year, he would probably 
cite a number of reasons— pupil achievement is better, teadiers find it easier 
to teach classes showing a narrow range of ability, the slower children do not 
become a hindrance to those who learn more readily, etc. Yet, when the re- 
search in the field is examined, the findings are generally inconclusive.* 

To some degree, the conflicting results obtained in 'the scores of studies 
which have been conducted over the past 40 years is understandable. In most 
instances, the conditions under which the studies were conducted differed 
markedly. Moreover, most of the studies in the area of ability grouping com- 
pare the performance of pupils enrolled in “homogenous” and “heteroge- 
neous” groups. 

What is a “homogeneous” group? In most instances, the designation is a con- 
venlenradministrative label. It is not a generic term. Whether or not a class 
isiruly homogeneous depends on the spread of ability in the total population 
from which the class is drawn. It is not inconceivable that a so-called “hetero- 
geneous” class drawn from a population with a narrow range will actually 
show less variation in ability than a so-called “homogeneous” class drawn 
from a broad-range population. 

There is another shortcoming characteristic of the research in the field of 
ability grouping. Most of the studies tend to focus their attention not on the 
performance of the homogeneous or heterogeneous classes that have been 
formed, but on the performance of the children enrolled in such classes. The 
individual pupil, rather th(.n the dass, is the unit of analysis. The findings 
of a typical study are reported in the following terms: “Pupils enrolled in 
homogeneous groups, as contrasted with matched pupils enrolled in hetero- 
geneous groups, tend to....” Somewhere in the program of analysis, the class 
has disappeared. 

In view of the shortcomings noted above, there appears to be need for a 
study of ability grouping in which homogeneity would be strictly defined, 
and in which the class, rather than the pupil, would be the unit of analysis. 
Such a study is reported below. Homogeneity is defined in terms of the stan- 
dard deviation of cbss performance on an initial test, and growth is measured 
in terms of differences in class means on initial and final tests. 



In a study conducted in the New York .City schools, parallel forms of the 
Metropolitan Reading Test were administered in May of two successive years 
lo all third-gradeclaMesalMl to all fourth-grade dasses in more than 75 schools. 
Those classes that had remained virtually intact (no more than two pupils had 
left or been added to the dass) over the period of one year which had elapsed 
were identified. Because of mobility and of pupil absence on the date of test- 
ing, test data were not avf ilable for both years for every pupil. Classes in which 
data were not available for at lend 10 pupils, and for at least 75 per cent of tlie 
pupils on register, were dropped. These restrictions effectively eliminated 
dasses that were abnormally small, and classes for which only partial data were 
available. 

on register, were dropped. These restrictions effectively eliminated classes 
that were abnormally small, and classes for which only partial data were avail- 
able. 

After these restrictions had been applied, data for a total of 4,705 pupils 
enrolled in 181 classes drawn from 42 schools remained available for further 
analysis. These classes were divided into three groups, based on mean initial 
test scores. Since the initial test was administered in May, when normal achieve- 
ment would be represented by a grade score of 3.9, all dasses in which the 
mean initial reading grade fell between 3.5 and 4.4 were classified as showing 
*T*f**® ■chievement. The standard deviation at the initial testing was used td 
divide the classes in terms of homogeneity. A class was considered as showing 
•verage homogeneity if the standard deviation fell in the range from 6.0 
thitH^h 8.9 months. 



Thu Radlnaa 

A summary of the mean gains, in months, shown by the participating dasses 

Im * eM„a_a_ 



is praseuted in the following Table. 



O 

ERIC 

LIJIILW 

I 




■I iiww upww Of ono nomoponoiqf nn' 





Test 1 


Word Knowledge 








Achkv^menf L 


cf Ckui 




High 


Avtr$g9 






4S ffffd oiff 




tkigh 


N 


11 


9 


5.9 and Below 


Mean 


?0.7 


11.2 


AvtraKt 


N 


12 


40 


6.0 - 8.9 


Mean 


18.1 


li.l 


Low 


N 


27 


26 


9.0 and Over 


Mean 


14.7 


12.6 1 


Total 


N 


50 


75 1 




Mean 


16.8 


1S.2 




Test 11 


Reading 










Achiewfiuni 


HomogtntUy of Ckst 




High 


Ap0fk^ 






45 and otwr 


SS-44 J 


High 


N 


12 


17 


5.9 and Below 


Mean 


19.1 


11.2 


Average 


N 


)4 


38 


6.0 -8.9 


Mean 


13.4 


11.0 


Low 


N 


21 


20 


9.0 and Over 


Mean 


17.1 


13.9 


Total 


N 


47 


75 




Mean 


16.5 


11.8 



The 181 classes, taken as a group, gained 13.1 months 
ledge and 12.5 months in Reading over the one year perio 
and final testing. As one would ex.pect, mean gains in achie^ 
be positively associated with initial reading level. Classes 
achievement showed greater mean growth than those witi 
achievement; the mean ^owth shown by the latter, in turn, 
that of classes with low initial achievement. This trend was n< 
sections of the achievement test. 

The same generalization could not be advanced, however, 
were divided into subgroups showing high, average, and 1 
When this was done, mean growth in Word Knowledge was t 
for classes showing average or low homogeneity, while on 
test, the mean growth of classes showing low homogeneity wa 
of classes with average or high homogeneity. 

Lack of a consistent growth pattern was even more evidi 
combinations of initial achievement level and class homo| 
sidered. For example, greater growth in Word Knowledge wa 
with high initial achievement as class homogeneity increa 
showing low initial achievement, however, greater growth wi 
increasing heterogeneity. In the case of classes showing averaf 
ment, the greatest growth was noted in classes with an aven 
and the least growth in classes with high homogeneity. 

A similar pattern of inconsistency was noted in Ihie Read 
the achievement test. For those classes showing high initial : 
greatest mean gains were made by classes that were classified i 
geneity category. In the case of classes showing average ini 
the greatest gains betv/een initial and final tesU wera obwrv 
low homogeneity. For classes showing low initial achievei 
mean gaips were noted in dasses with average honiogenei^< 



is*, - . i Mr i t BllR 



mmi^ 













138<>2 



roijping-virhat uood It Kt 

tstman 



THIS DOCUMENT HAS lEEN REPRODUCED EXACTLY AS RECEIVED FROM THE 
PERfON OR ORGANIZATION ORIGINATING IT. POINTS OF VIEW OR OPINIONS 
STATED DO NOT NECESSARILY REPRESENT OFFICIAL OFFICE OF EDUCATION 
POSITION OR POLICY. 



U booSL ^ 



to ;isk an elementary school superv 'jor why he uses ability group- 
nizing his school at the beginning of each year, he would probably 
T of reasons— pupil achievement is better, teadiers find it easier 
ises showing a narrow range of ability, the slower children do not 
tindrance to those who learn more readily, etc. Yet, when the re- 
le field is examined, the findings are generally inconclusive.* 
degree, the conflicting results obtained in 'the scores of studies 
bMHi conducted over the past 40 years is un'^Ierstandable. In most 
M conditions under which the studies were conducted differed 
Moreover, most of the studies in the area of ability grouping com- 
iformance of pupils enrolled in “homogenous” and “heteroge- 
sps. 

'iKNnogeneous” group? In most instances, the designation is a con- 
ifaifstrative label. It is not a generic term. Whether or not a class 
logeneous depends on the spread of ability in the total population 
the class is drawn. It is not inconceivable that a so-called “hetero- 
drawn from a population with a narrow range will actually 
mriation in ability than a so-called “homogeneous” class drawn 
Kkrange population. 

mother shortcoming characteristic of the research in the field of 
Eping. Most of the studies tend to focus their attention not on the 
of the homogeneous or heterogeneous classes that have been 
the performance of the children enrolled in such classes. The 
Mlpit, rathei^than the class, is the unit of analysis. The findings 
study are reported in the following terms: “^pils enrolled in 
groups, as contrasted with matched pupils enrolled in hetero- 
ips, tend to....” Somewhere in the program of analysis, the class 



filie shortcomings noted above, there appears to be need for a 
ility grouping in which homogeneity would be strictly defined, 
h the class, rather than the pupil, would be the unit of analysis. 
f is reported below. Homogeneity is defined in terms of the stan- 
I of ckus performance on an initial test, and growth is measured 
MPerences in class means on Initifal and final tests. 



conducted in the New York City schools, parallel forms of the 
■vimading Test were administered in May of two successive years 
yhdlclassesahd toaUlo^^ more than 75 schools, 

that had remained virtually intact (no more than two pupils had 
to the class) over the period of one year which had elapsed 
led. Because of mobility and of pupil absence on the date of test- 
are^ not avfilable for teth years for every pupil. Classes in which 
il Ji^lable fpr at Itail fO pipits, and for at least 75 per cent of the 
', were dropped. These restrictions effectively eliminated 
XU abnormally small, and classes for which only partial data were 

ilini dropped. These restrictions effectively eliminated classes 
normally small, and classes for which only partial data were avail- 

restrictions had been applied, data for a total of 4,705 pupils 
181 daises drawn from 42 schools remained available for further 
classes were divided into three groups, based on mean initial 
hce the initial test was administered in May, when normal achievc- 
be represented by a grade score of 3.9, all classes in which the 
reading grade fell between 3.5 and 4.4 were classified as showing 
evement. The standard deviation at the initial testing was used to 
in terms of homogeneity. A class was considered as showing 
Mjgeneity if the standard deviation fell in the range from 6.0 
months. 



Dfthe mean gains, in months, shown by the participating classes 
in the following Table. 



MaanOalna(QfadeiloQnNle4)an RialrapaMan neaffag TmI AeMesud hr 

■I uvww VI Monivwifivfii ■iiQ iin moiipivi 





Test 1 


— Word Knowledge 












AchkvomonI IavoI of Ckss 




of CAut 




Nigh 


Avorogo 


Low 








<Sond ovor 


S5-H 


94ond Bohw 


ToM 


High 


N 


11 


9 


30 


50 


5.9 and Below 


Mean 


20.7 


11.2 


9.0 


».o 


Avtragt 


N 


12 


40 


24 


76 


6.0 - 8.9 


Mean 


18.1 


14.1 


10.2 


IS.5 


Low 


N 


27 


26 


2 


55 


9.0 and Over 


Mean 


H.7 


12.6 


12.9 


1S4 


Total 


N 


50 


75 


56 


•81 




Mean 


16.8 


13.2 


9.7 


IS.I 




Test II 


Reading 














AchkvomonI Lovoi of Clou 




Homogontiiy of Ck$s 




High 


Avorogo 


Low 








45 ond ovor 


S5-44 


34 ond Bolow 


Tattl 


High 


N 


12 


17 


25 


54 


5.9 and lelow 


Mean 


19.1 


11.2 


9.1 


110 


Average 


N 


14 


38 


26 


78 


6.0 -8.9 


Mean 


13.4 


11.0 


11.2 


UJ 


Low 


N 


21 


20 


8 


48 


9.0 and Over 


Mean 


17.1 


13.9 


9.5 


14.0 


Total 


N 


47 


75 


59 


181 




Mean 


16.5 


11.8 


10.1 


115 



The 181 classes, taken as a group, gained 13.1 months in Word Know>' 
ledge and 12.5 months in Reading over the one year period between initial 
and final testing. As one would expect, mean gains in achievement tended Ut] 
be positively associated with initial reading level. Classes with high initii 
achievement showed greater mean growth than those with average init: 
achievement; the mean growth shown by the latter, in turn, was greater thai 
that of classes with low initial achievement. This trend was noted on both sub^ 
sections of the achievement test. 

The same generalization could not be advanced, however, when the clasaaa 
wert divided into subgroups showing high, average, and low homogeneity.^ 
When this was done, mean growth in Word Knowledge was virtually identici^' 
for classes showing average or low homogeneity, while on the Reading suh^ 
test, the mean growth of classes showing low homogeneity was higher than that 
of classes with average or high homogeneity. 

Lack of a consistent growth pattern was even more evident when various 
combinations of initial achievement level and class homogeneity were con- 
sidered. For example, greater growth in Word Knowledge was shown by clasaas 
with high initial achievement as class homogeneity increased; with classed 
showing low initial achievement, however, greater growth was associated witl 
increasing heterogeneity. In the case of classes showing average initial achii 
ment, the greatest growth was noted in classes with an average homogmeity, 
and the least growth in classes witii high homogeneity. 

A similar pattern of inconsistency was noted in the Reading subseii^ion of 
the achievement test. For those classes showing high initial achievement, the 
greatest mean gains were made by classes that weiu classified in the high fepeno* 
geneity category. In the case of classes showing average initial achiei^cnt, 
the greatest gains between initial and final tests were observed in daa^ vdth 
low homogeneity. Fcht classes showing low initial achievement, th^pfiB*** 
mean gains were noted in classes with average bmnogeneity. 



ft- ERIC 



iiahaaiiwiiMi l. I'ii . r 










Conclusions 

It is very < lo:ir n‘(lncin^ tin* of in |Ih*s<> d;iss<>s was not asso- 

( iai(*(l with iiurcasod a< Idovcmcnl in roadin^. Tlu> losson tor !lio sdiool ad- 
minislralor is (*(|nally ('l(‘ar-~lioino^cncons ^roupin^ is not a panarra for 
cdncalional ills. 1 he s< h«)ol adniinisiralor who looks to hoino^^Miotis ^roiipin^ 
as a means of improving pupil a< hievemeni will find the process of little value 
unless deiinile programs, spec ifically desif;nc*cl lor the several ahilitv lc*vc‘ls 
into which they uronp their c lassc*s, are cic*velopc*cl. (iroupin^ by itself, without 
c urricular modification as a concomitant, will not nive rise to the dc'sired out- 
come of improved pupil performance. 

•A/inV/m Ooldhert,' and others. The Effects of Ability Grouping. 

New York: ’Teachers College Press, PXid. 



Dr. Joseph Justman is acting director of the Bureau of Research for the Board of Educa- 
tion of the City of New York. 



